Synthetic Humans for Action Recognition from Unseen Viewpoints

نویسندگان

چکیده

Abstract Although synthetic training data has been shown to be beneficial for tasks such as human pose estimation, its use RGB action recognition is relatively unexplored. Our goal in this work answer the question whether humans can improve performance of , with a particular focus on generalization unseen viewpoints. We make recent advances monocular 3D body reconstruction from real sequences automatically render videos labels. following contributions: (1) we investigate extent variations and augmentations that are improving at new consider changes shape clothing individuals, well more relevant non-uniform frame sampling, interpolating between motion individuals performing same action; (2) introduce generation methodology, SURREACT allows spatio-temporal CNNs classification; (3) substantially state-of-the-art NTU RGB+D UESTC standard multi-view benchmarks; Finally, (4) extend augmentation approach in-the-wild subset Kinetics dataset case when only one-shot available, demonstrate improvements well.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Universal Representation for Unseen Action Recognition

Unseen Action Recognition (UAR) aims to recognise novel action categories without training examples. While previous methods focus on inner-dataset seen/unseen splits, this paper proposes a pipeline using a large-scale training source to achieve a Universal Representation (UR) that can generalise to a more realistic Cross-Dataset UAR (CDUAR) scenario. We first address UAR as a Generalised Multip...

متن کامل

Dynamic visual information facilitates object recognition from novel viewpoints.

Normally, people have difficulties recognizing objects from novel as compared to learned views, resulting in increased reaction times and errors. Recent studies showed, however, that this "view-dependency" can be reduced or even completely eliminated when novel views result from observer's movements instead of object movements. This observer movement benefit was previously attributed to extra-r...

متن کامل

Extrinsic cues aid shape recognition from novel viewpoints.

It has been shown previously that the visual recognition of shape is susceptible to the mismatch between the retinal input and its representation in long-term memory, especially when this mismatch arises from rotations in depth. One possibility is that the visual recognition system deals with such mismatch by a transformation of the input or the representation thereby bringing both into alignme...

متن کامل

Action Recognition in Semi-synthetic Images using Motion Primitives

This technical report describes an action recognition approach based on motion primitives. A few characteristic time instances are found in a sequence containing an action and the action is classified from these instances. The characteristic instances are defined solely on the human motion, hence motion primitives. The motion primitives are extracted by double difference images and represented ...

متن کامل

Adaptation of Pronunciation Dictionaries for Recognition of Unseen Languages

This paper studies the relative effectiveness of different methods for multilingual model combination and dictionary mapping for recognizing a new unseen target language if training data are limited. We examine the crosslanguage transfer from monolingual and multilingual models to German and Russian language for large vocabulary speech recognition using a dictation database which has been colle...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Vision

سال: 2021

ISSN: ['0920-5691', '1573-1405']

DOI: https://doi.org/10.1007/s11263-021-01467-7